Partially observable Szilárd engines
نویسندگان
چکیده
Leo Szilard pointed out that Maxwell's demon can be replaced by machinery, thereby laying the foundation for understanding physical nature of information. Szilard's information engine still serves as a canonical example after almost hundred years, despite recent significant growth area. The role plays reduced to mapping observable data meta-stable memory, which is utilized extract work. While showed map implemented mechanistically, it was chosen priori. choice how construct meaningful memory constitutes demon's intelligence. Recently, shown this automated well. To end, generalized, partially engines were introduced, providing basis processing. Partial observability ubiquitous in real world systems have limited sensor types and acquisition bandwidths. Generalized run work extraction at different temperature, T' > T, from forming process. This enables combined treatment heat engines. We study characteristics intelligent observers introducing model displays richness, its simplicity. A minor change - inserting divider an angle results family Their analysis shows intelligence automated. For each angle, value T'/T, optimal found, enabling with minimal dissipation. Those memories are probabilistic maps, computed algorithmically. discuss they simple system, characterize their performance, compare quality naive, deterministic quantizations observable.
منابع مشابه
Relational Partially Observable MDPs
Relational Markov Decision Processes (MDP) are a useful abstraction for stochastic planning problems since one can develop abstract solutions for them that are independent of domain size or instantiation. While there has been an increased interest in developing relational fully observable MDPs, there has been very little work on relational partially observable MDPs (POMDP), which deal with unce...
متن کاملA Partially Observable Markovian Maintenance Process with Continuous Cost Functions
In this paper a two-state Markovian maintenance process where the true state is unknown will be considered. The operating cost per period is a continuous random variable which depends on the state of the process. If investigation cost is incurred at the beginning of any period, the system wit I be returned to the "in-control" state instantaneously. This problem is solved using the average crite...
متن کاملLearning Partially Observable Action Schemas
We present an algorithm that derives actions’ effects and preconditions in partially observable, relational domains. Our algorithm has two unique features: an expressive relational language, and an exact tractable computation. An actionschema language that we present permits learning of preconditions and effects that include implicit objects and unstated relationships between objects. For examp...
متن کاملPartially observable Markov decision processes
For reinforcement learning in environments in which an agent has access to a reliable state signal, methods based on the Markov decision process (MDP) have had many successes. In many problem domains, however, an agent suffers from limited sensing capabilities that preclude it from recovering a Markovian state signal from its perceptions. Extending the MDP framework, partially observable Markov...
متن کاملLearning Partially Observable Action Models
In this paper we present tractable algorithms for learning a logical model of actions’ effects and preconditions in deterministic partially observable domains. These algorithms update a representation of the set of possible action models after every observation and action execution. We show that when actions are known to have no conditional effects, then the set of possible action models can be...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: New Journal of Physics
سال: 2022
ISSN: ['1367-2630']
DOI: https://doi.org/10.1088/1367-2630/ac6b30